EXISTENCE OF POSITIVE DEFINITE NONCOERCIVE SUMS OF SQUARES 

IN M[a;i,...,x„] 



GREGORY C. VERCHOTA 



Abstract. Positive definite forms / S IR[ 3^1, . . . , Xn] which are sums of squares of 
forms of Mfzi, . . . , Xn] are constructed to have the additional property that the members 
of any collection of forms whose squares sum to / must share a nontrivial complex root in 
C". 



1. Introduction 

Let / e M.[xi, . . . , Xn] be a form, i.e. homogeneous polynomial. Suppose / is a sum 
of squares (sos) of forms in M.[xi, . . . , Xn] and is positive definite (pd), /(a) > for all 
a e M" \ {0}. Writing f — J2Pj '^his is equivalent to saying that the forms pj share no 
common nontrivial real root from M". 



(1.1) 

Suppose a positive definite /orm / has at least one sos representation. Does f necessar- 
ily have a representation f = with G Mfxi, . . . , Xn] and the qk sharing no 
common complex root from C" \ {0}? 

For example, 

(i) the positive semi-definite (psd) x\ = p^ ^ R[xi, 2:2, xz] is uniquely represented as an 
sos, and p(0, 1, i) = 0; 

(ii) x\+ x\ E R[xi, X2] is pd with xi and X2 sharing no common nontrivial complex root; 

(iii) / = (xf + X2)'^ = p^ is pd with the quadratic form p having the root (1, i) G C^. 
But also / ~ {xD^ + {y/2xiX2)^ + (xl)^ or {x^ — x^Y + {2x1X2^ ™d in each case the 
quadratic forms now share no common nontrivial complex root. 

Though not the subject of this article, the study of boundary value problems for elliptic 
partial differential equations (PDE) motivates question ( ll.lt . Denote by 5 = (9i , . . . , 9„) = 

. . . , ^f-) the vector of first partial derivatives for M". Let a G Wq denote a multi- 
index. Define |q;| = ai + • • • + q;„ and d" = d"^ ■ ■ ■ d"". 



A theorem of N. Aronszajn and K. T. Smith | Agm65 1 may be stated as 
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Let pi, . . . ,pr € M[a:;i, . . . be forms of degree d. Let VL C M" be a bounded open 
connected set with suitably regular boundary and let SI be its closure. Then the integro- 
differential quadratic form 

(1.2) V / \p,{d)u\^dx 

is coercive over all functions u which have continuous partial derivatives of order d in VL 
that extend continuously to if and only if the system 

Pi = P2 = ■ ■ ■ = Pr ^ 

has no solution a S C" \ {0}. 

For ( 11.21) to be coercive over the collection of functions u it is required, by definition, 
that there be constants C > and cq G M independent of the functions u so that 

(1.3) V / \pj{d)ufdx>C [ V |a"updx-co / \u\^dx 

for all u in the collection. Once this estimate is obtained various elliptic boundary value 
problems can be solved. 

The Aronszajn-Smith theorem gives a precise algebraic characterization of all integro- 
differential forms ( 11.2b for which the coercive estimate ( 11.3b can hold. The integro-differential 
forms (11.21 ) are termed formally positive because of their sos shape. S. Agmon | |Agm58| 
improved this result by proving a necessary and sufficient (and more complicated) alge- 
braic condition on all integro-differential forms 

(1.4) XI H / CLapd^vWudx 

\a\<d\!3\<d-'^ 

not only the formally positive, that give rise to self-adjoint Unear properly elliptic differen- 
tial operators 

(1.5) L{d) =Y.J2 

|a|<ci |/3|<d 

and their regular boundary value problems | Agm5 8 1 1 Agm60) . When G M and the 



integro-differential form is formally positive, L corresponds to a polynomial / of degree 
2d that is a sum of squares. 

With his algebraic characterization Agmon solved completely the coerciveness prob- 
lem for integro-differential forms in the theory of linear PDE. However, the coerciveness 
problem for linear differential operators L{d) — X]|a|<2d '^ad" has not been solved. This 
problem can be stated in a way that leads back to the question about sums of squares in 

R[xi, . . .,Xn]- 

Instead of the integro-differential form one begins with the homogeneous constant co- 
efficient operator in M" 

L{d)= 

\a\=2d 

Ua E K. These will be self-adjoint. Suppose L is elliptic (equivalent to properly elliptic in 
this setting) L(^) > for all ^ e K" \ {0}. In general L can be rewritten an infinity of 
ways in the shape ( |1.5b 

(1.6) L{d)^ J2 ^c^pd^^^ 

|a|=|/3|=d 
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and therefore admits an infinity of integro-differential forms ilAi . Is there any choice of 
rewriting ( 11.61 ) that yields a coercive estimate? 

This fundamental question is broader than what can be answered here. Instead the 
question will be specialized to the setting of the Aronszajn-Smith theorem. 

Suppose it is further known that the homogeneous differential operator is an sos, L{d) = 
J^Pji^)- Then the theorem provides the necessary and sufficient algebraic condition for 
the integro-differential form ( 11.2b to be coercive ( 11.3b . If the form were to fail the algebraic 
condition and thus fail to be coercive is there another way to write the differential operator 
L as a sum of squares and thereby use the theorem again to obtain the coercive estimate 
for a new integro-differential form associated to L and thus solve boundary value problems 
for L7 This is question dl.ll) . 

All the results and proofs of this article are independent of these PDE considerations. 
Some more will be said about PDE in the last section. 

Definition 1.1. / G M.[xi, . . . , Xn] is called a sum of squares (an sos) if there exist poly- 
nomials pi, . . . ,pr £ M.[xi , . . . , Xn] so that / has the representation / = X]j^=i Pj 

Definition 1.2. An sos f G M[a;i, . . . , Xn] is called coercive or a coercive sum of squares 
if there exists a representation 

r 

(1-7) /-Ep| 

with pi, . . . ,pr G M.[xi, . . . , Xn] such that there are no solutions a G C" \ {0} to the 
system 

(1.8) pi = ---=pr = 

When such an / is homogeneous it is also called a coercive form. 
To be clear 

Definition 1.3. An sos f G M[xi, . . . , Xn] is called noncoercive or a noncoercive sos if 
there exists a representation (11.7b for / and if every such representation has a nontrivial 
solution in C" to the corresponding system dl.Sb . 

Question (II. lb asks if every positive definite sos is coercive. The aim of this article is to 
establish, by construction, the existence of positive definite noncoercive sums of squares. 
That this can be done is related to the well known fact that not every positive definite 
polynomial is a sum of squares. 

If every pd polynomial were an sos the answer to question dl.lb would be yes. This 
follows because positive definiteness of / allows 

(1.9) f^[f-e{xf + --- + xlf)]+e{x\'' + ---) 

with the bracketed term pd for e > small enough. When the bracketed term is an sos, 
(11.9b is an sos representation for / that satisfies the definition of coercive sos. 

We adopt standard notations for psd homogeneous polynomials IICL78I IIBCR98I p. 1 1 1 . 
Pn.d denotes the set of / G R[a;i, . . . , a;„] homogeneous of degree d that are nonnegative 
on R". En d denotes the set of all / G Pn,d that are sos. These sets are nonempty only 
when d is an even number. 

For the remainder of this article all polynomials will be homoge- 
neous POLYNOMIALS, OR FORMS. 

(Homogenization can be used for other statements.) 
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The argument given above together with Hilbert's results on positive polynomials that 
are sos IIHil88l . MRezOTI immediately yields the Theorem 

(1.10) 

If n < 2 and d is an even natural number, or if d = 2 and n is a natural number, or if 
{n, d) = (3, 4), then every pdform of Pn,d o. coercive sum of squares. 
The result of Hilbert | |R^93| , OSwaOOl . ORudOOl . llP504ll . llPROOl used here is that P3,4 = 



E3 4, while P2,2p = ^2,2p and P„,2 = 5]„_2 are elementary. See IBCR98 | pp. 11 1-1 12. 

Hilbert further proved that in every other case E„ 2p is a proper subset of P„,2p, elim- 
inating the argument based on ( |1.9t . It was T. S. Motzkin IIMot671 who first published 
explicit examples of positive semi-definite polynomials that were not sos. There are now 
various examples of these, e.g. IIRob73l . IICCTSI . 0CL77I . IILL781 : see ORezOOl formore. We 
found two of these to be very useful for the purpose here. Both are of Motzkin type and 
due to M. D. Choi and T. Y. Lam. 



(1.11) <l{w^ x,y, z) = w'* + a;^y^ + j/^z^ + z^x^ — Awxyz 
and 

/ ^ 42,42,42 o222 

s[x, y, z) — X y + y z + z x — Sx y z 

Both are nonnegative (psd) by the arithmetic-geometric mean inequality and neither is an 
sos. Thus q g P4^4 \ i;4_4 and s £ P^^ \ Sa^g- 
For T] > define 

(1.12) q^ = q + rj{x^ + y^ + z^) 

= s + r]{x^ + / + z^) 

For T] > 0, q^f and are pd. As long as r/ is small enough each is not an sos. This follows 
by an elementary topological argument first given by R. M. Robinson IIRob73l pp. 267-268 
which, moreover, shows the sets E to be topologically closed sets. It is also true that for 
all 7] large enough q,, and s,, are sos. See, for example, p. 269 of IIRob73l (in the case of 
q^i it can be verified that the term in q obviates the need to add rjw'^). Consequently for 
each polynomial there is a smallest value of 77, 770 > 0, that makes q,^ or s,, sos (cf. also 
the proof of Corollary 5.6 |CLR95| p. 122). In Section[3]it is shown for the quartic q that 
the square root of this value is the smallest positive root of — + ^ — 0, and that 

(1.13) q,^„{w,x,y,z) = {w^ - ^/t^{x^ + + z^) f + 

2 

■7^—=[{i\/mwx - yzf + {Sy^wy - zxf + [?,y^wz - xyf] 

In addition, it is proved that there is exactly one Gram matrix (or Gramian ||Gel89l ) that 
represents the polynomial (7,,^ . This means that every other sos representation for q^-j^ is 
merely a sum of squares of quadratics that are linear combinations of the quadratics of 
( II.I3I 1. Thus any common complex roots must be the same among all representations. 

The Gram matrix method of Choi, Lam and B. Reznick IICLR95L used for studying 
SOS representations of polynomials, is put into a tensor setting in Section |2l Every form 
of degree 2p is nonuniquely represented by a symmetric matrix (rank-2 symmetric tensor) 
acting as a quadratic form on the vector space of rank-p symmetric tensors. These are 
termed representation matrices for the form. The Gram matrices are those representation 
matrices that are psd, necessary and sufficient for an sos representation. 
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The polynomial (11.131 1 provides an example of a positive definite quartic with a unique 
Gram matrix. A positive definite sextic with a unique Gram matrix has previously been 
identified by Reznick in IIPra06ll . It is like the ones that will be constructed in Section |5] 
from the Sri- 

However wonderful it is, q^jg is coercive. It is proved in Section|4]that 
(1.14) {u'^ + v'^ + vw)'^ + qrfg{w,x,y, z) 

is positive definite and noncoercive in S6,4- In effect the uniqueness of representation of 
( 11.13b and the presence of the monomial vw forces a uniqueness of representation upon 
( 11.14b . while (1, i, 0, 0, 0, 0) is a solution to the corresponding system of quadratic equa- 
tions (11.8b . It follows from the definition of coercive sos that any form / e K[a;i, . . . , Xn] 
of even degree d such that / + xf^^i is a coercive sos must itself be a coercive sos. Con- 
sequently monomials be added to (11.14b preserving all required properties 
and the following theorem and partial answer to question (II. lb is obtained. 

Theorem 1.4. For n > 6, S„,4 contains polynomials that are positive definite and nonco- 
ercive. 

Theorem 1 1.41 is really a statement about certain cones of polynomials. After a scaling 
(11.13b can be rewritten 

(1.15) 

ai{x\--i{x\+x\+x'l)f +a2{xiX2~XzXi)'^ +a'i{xiXr^-XiX2Y +ai{xiXi-X2X'iY 

where it happens that for all values of 7, < 7 < | and all positive ai , . . . , 04, the forms 
( 11.15b are pd with a unique Gram matrices. 

Corollary 1.5. Forn > 6 there exist nonempty collections of quadratic fonns {pi, . . . ,Pr} C 
M[a;i, . . . , Xn] so that there exist no nontrivial solutions from R" to the systems pi — P2 — 
■ ■ ■ — Pr — 0, and so that every f — o.jPj' ^ith positive coefficients ai , . . . , Or, is a 
noncoercive sos. 

The Choi-Lam sextic form s (II. lib possesses more structure than its quartic counterpart 
q. First it is an even form. A form / is even if it is also a polynomial 
Second it is symmetric. A form / is symmetric if for every permutation cr on n objects 
/(x) = /((t(x)). The construction (11.12b of the forms preserves both of these proper- 
ties. In Section|5] for Sjj{x, y, z) with a unique Gram matrix, it is proved that when x"^ is 
replaced with + x^ the resulting form is pd and noncoercive. 

Theorem 1.6. For n> A, S„ g contains polynomials that are positive definite and nonco- 
ercive. 

The additional structure provided by the non-sos s seems to be the reason Theorem 
1 1.61 comes closer than Theorem 1 1.41 to being a complete result. As remarked on p. 263 of 
BRezOOi and in |Har 99|, in any dimension every psd even symmetric quartic form is an 
sos. Further, the replacement of x"^ with + x"^ that works in the sextic construction 
seems to rely more on the even property than it does on symmetry. It turns out that every 
psd even quartic form in n = 4 or fewer variables is a sum of squares. This follows from 
results of P. H. Diananda IIDia62ll . Thus constructing a quartic noncoercive sos for n = 5 
from an even form in 4 variables in a way analogous to the sextic case is not possible. On 
the other hand the Horn form |HN63 1 pp. 334-335 MDia62l p.25 IIRezOOl p.260 provides a 
psd even quartic form for n = 5 that is not an sos. See IICL78II pp. 394-396. 
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Between the coercive Theorem (fTTOl l and the noncoercive Theorems 11.41 and 11.61 di- 
mensions 4 and 5 for the former and 3 for the latter remain obscure. This puzzle will be 
discussed further in Section|6] 

2. A MULTILINEAR SETUP 

At first let , . . . , e" and ei , . . . , e„ be the standard (contravariant and covariant) basis 
vectors for M". The scalar product of vector and covector is denoted x • u = ^ XjU^ 
where xi, . . . , . . . are the standard coordinates of x and u. The nonnegative integers are 
denoted No. For a multi-index a e No its order is\a\ — a + - ■ • + «„, and a! = ai! • • • a„! 
For X e R", x" = a;^i • • • a;^" . 

The (contravariant) tensors t of rank p are multilinear (p-linear) forms mapping p vec- 
tors of R" to K by 

t • x^x^ • • • xP = V X? • • • X? t^^i^-^p 

The coordinates of t are P^"'^p and are obtained by t • e^^ • • • e-'p = P^"'^p. See BvdWTOI 
pp.74-75, 80-81. 

Given p (co)vectors Ui , . . . , Up a tensor t of rank p may be defined by the tensor product 

t = Ui (g) • • • (g) Up 

which acts multilinearly as 

(2.1) t • X^X^ . . - xP = (X^ • Ui)(x2 • U2) • • • (xf • Up) 

sothatt-Ji -^p ^uf ■■■ul". 

The collection of tensors of rank p, TP(R"), forms a vector space over R of dimension 
nP with standard basis 

Let (3p denote the symmetric group of all permutations of p objects. For each a e (3p the 
map 

defines a permutation of the basis vectors of T^'(R") and thereby induces a (unique) linear 
isomorphism on TP(R") ||Yok92| p.43. If P„{t) = t for all a € &p, then t is called a 
symmetric tensor. The set of all symmetric tensors of rankp, 5p(R"), also forms a vector 
space over R. The linear operator 

Sym = Synip ^ y^ P^ 

is a projection from T?'(R") onto S'p(R") so that 

(2.2) {5'ym(ej, <»■■■<» J : 1 < ji <■■■< jp < n} 
forms a basis for ^^(R"). Further, 

(2.3) dim{SP{W')) = 
IIYok92ll pp.47-48. 

Given indices ji < ■ ■ ■ < jp as in ( 12.21 ) let ak equal the number of indices equal to k 
for 1 < fc < n. In this way the multi-indices a G Nq of order p are put in one-to-one 
correspondence with the basis elements of S'^'(R"). Denote 

(2.4) = Sym{ej, • • • (g) e^- J 



n + p - 
P 
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for each basis element in (12.2b where a corresponds to ji < • • • < jp. 

The coordinates of Eq, (as a tensor in T^(R")j E'^'"^'^ are either or and sum to 1. 

Example 2.1. (i) For p — 2, E(2,o....o) — ei ® ei with E^^q, o) ~ -'^ '^^^^ nonzero 
coordinate. 

E(i,i,o...,o) = 5(ei (8i e2 + 62 (g) ei) with i_o....o) = = 1 ""^y nonzero 

coordinates. 

Thus {Eq : |q;| = 2} is identified with an orthogonal basis for the n x n symmetric 
matrices under the Hilbert-Schmidt inner product, 
(ii) Forp = 3, E(3^o....o) = ei (g) ei (g) ei. 

E(2,i,o...,o) = |(ei (g) Gi (g 62 + ei (g 62 (g ei + 62 g) ei (g ei). 

E(l,l,1^0...,0) = ^(ei g) 62 (g 63 + 63 (g 61 (g 62 + 62 g) 63 6i + 61 (g 63 (g 62 + 62 (g 
61 g) 63 + 63 g) 62 g) 61). 

For a vector x G M" (or C") each basis element E^ S ^^(M") therefore acts multilin- 
early on x as 

(2.5) E„ • XX • • • X = x" 
Therefore 

The vector space S'^'(R") is isomorphic to the vector space of homogeneous polynomials 
of degree p from R[xi, . . . , 

See, for example. Theorem 2.5 p. 67 of B Yok921 . 

In the same way the vector space of (covariant) tensors Tp (R" ) dual to (M" ) ( MYok92| . 
pp.53-54) is formed. Putting s = x^ g) • • • g) x?" e Tp{R'^), (lOT i can be rewritten as the 
dual pairing 

(2.6) t • s = (x^ • ui)(x2 • U2) • • • (xP • Up) 

A basis for the (covariant) symmetric tensors 5p(R") is defined similarly to (I2.2l l. and 
basis elements E", \a\ — p, are defined as in (12.4b . By the normalizations 

(2.7) N„ = \ and N" = J ^E" 

V a! V a! 

one obtains dual bases 

(2.8) N„ • N'^ - 6^ 

where the Dirac delta is equal to when a ^ /3 and 1 otherwise. 

Because these dual symmetric spaces are isomorphic, no longer will any distinction be 
made between them. Instead S'p(R") will be considered an inner product space with inner 
product formed as in ( 12.6b . Bases will be written {Eq : la] = p}, {N^ : \a\ = p} 
an orthogonal and an orthonormal basis respectively. Vectors of R" will be enumer- 
ated x^, x^, . . . , u^, . . . with subscripts indicating coordinates x = (xi, X2, . . .), x^ ~ 
(x}, X2, ...),... 

A convenient notation for the tensor product of p identical vectors is 



(2.9) 



x^P = xg).--g)xe S'P(R") 



g 
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When X 7^ the tensor x'^p will be referred to as a rank-one tensor even though it is an 
element of 5'*'(R"). For example, when p = 2 all n x n symmetric matrices that have rank 
1 are given by x*^^ = x (K) x. Now ( I2.5l l becomes 

E„ -x^P =x", \a\ =p. 

Since S'p(IR") is a real vector space, the foregoing can be done with it in place of R". Of 

particular interest is the space S'^(5p(K")) isomorphic to the space of ( 

j real symmetric matrices. These matrices will be referred to below as the 

representation matrices. 

Given any t e SP{R") the notation of will be applied as t^^ = t t e 
S'^{SP{W-)). Given also s, we introduce the notation 

S(g)st=S®t+t(g)S 

noting that 

t ®s t = 2t t 

and 

(S + 2 ^ S»2 _^ S t + 

A basis for the vector space S'^(5^'(R")) is 
(2.10) {E„ ®, E/3 : |a| = = p} 



It contains 



elements. More general elements of S"^ {S^ {W"-)) 



V 2 / 

will be denoted in script as with SorQ. All act as symmetric bihnear (quadratic) forms on 



S ■ st ~ S ■ ts 

For example 



(g>s E^ • E^E^ = 6tS^ + 6^6^ = 0, 1 or 2 



and in particular 



(2.11) 1e„«..E,.x-x-=x-^ 

By choosing a linear ordering for the multi-indices of order p, an isomorphism of 
5^ (5''' (M")) and the (^^^^ ^^^(^"^p symmetric matrices can be made 

explicit. Given ( 12. lib the one that is apparently most computationally convenient is in- 
duced by the mapping 



(2.12) E„ ®, E^ (<5^,5^ + 



\^\ = \uj\=p 



In this way an element of S'^(5p(R")) is assigned a representation matrix and vice versa. 
For example, with hnear order a ^ /3 -<•••, the tensor {aa^a + apE^ + ■ ■ ■ — 
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/ a'i aaap ■ ■ 



a^E^^ + aaCLpEja 0^ E^j + • • • is assigned the matrix 



a^ap al ■■■ I and by 



( 12.1 11 1 represents the form a^x^" + 2aaaf3'^°'^^ + • • • = (flax" + apyi 

A tensor of S'^(S'^(R")) and its representation matrix will be denoted by the same 
symbol. 

In addition ( 12.1 11 1 shows that 

Every element of S'^(S'p(M")) represents a homogeneous polynomial in R[xi, . . . , a;„] of 
degree 2p, and every such homogeneous polynomial can be represented by an element of 
S'2(S'P(R")). 

Such representations are not unique. S"^ {S^ {M.")) is not isomorphic to S'^p(R"). The 
respective dimensions are related by 



(2.13) 



P 



> 



n + 2p-l 
2p 



\ 2 / 

The following can be found on p.l09 of IICLR95L 

The subspace 

(2.14) A^'P'" = { A e S'2(5f (M")) : A • x^^x^p = for every x e M"} 

has as its dimension the difference of the two numbers in ( 12.131 ). 

To see this, the basis ( 12.101 ) for S'^(S'p(R")) can be partitioned into classes 

{E„ (g), E^i : a + /3 = 7} 

for each I7I = 2p, with the number of classes equal to dim(S'^P(R")). Beginning with a 
distinguished member of a class, the same span is obtained by the collection 

(2.15) {Ea ®s E^, E„ ®s E/3 - E„' ®s E/j-, E„ ®s E/3 - ^a" ®s E/j", . . .} 

where a + (3 — a' + (3' — ■ ■ ■ ~ ^j. Every element after the first is in the subspace A^.p.n 
By the definition of A^ p ", 

Two representation matrices for the same homogeneous polynomial of degree 2p always 
differ by a member of ^4^'^'". 

The members of the subspace ^4^'^'" when added to a representation matrix for a poly- 
nomial change the representation of the polynomial and do not change the polynomial. 
When a polynomial has an sos representation, adding what will be called a change A to 
that representation might or might not yield another sos representation. In the case it does 
yield another, it cannot alter the facts that the polynomials of degree p that are squared 
share or do not share a common real root. That they share or do not share a common 
complex root from C" \ CR", however, possibly can be altered by adding a A. Here 
CM" = {ax : a e C and X e R"}. 
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Example 2.2. A = E(2,o) ®s E(o,2) — '^^fii) "^^Y be allowed to serve as the only basis 

element for A^'^'^. Letting a e K 

(2.16) 

Sa := (E(2,o) + E(o,2))^' + aA - E(2,o) + (1 + a)E(2^o) ®s E(o^2) + E(o^2) - 2aEf^'i) 

when applied to x'^^x'*^ always yield the pd polynomial {xf + x^Y . Choosing a linear 
order (2, 0) -< (0, 2) -< (1, 1) for the basis elements of ^'^(E^), the isomorphism (12.121) . 
of ^^(^^(M^)) with the symmetiic 3x3 matrices, yields 

Sa - 

The eigenvalues are —a, 2 + a and —2a. Using these together with the corresponding 
unit eigenvectors suggests that ( 12.161 ) be written 

Sa = --(E(2,0) - E(o,2))'*^ H 1— (E(2,o) + E(o,2))'*^ - "^a^jfi^i) 

The representation matrix is psd if and only if —2 < a < if and only if 

is an SOS representaion. Among these, each quadratic term has the complex root x = (1, z) 
when a = 0, while there are no common complex roots when — 2 < a < 0. 

This example used the fact that a real symmetric m x m matrix may be written as an 
element of ^^(R™) 

m 

(2.17) ^AjU^^u^' 

j=i 

where the Xj are eigenvalues counted by multiplicity and G are the corresponding 
unit eigenvectors. 

The following proposition can be found in IICLR95I p. 106, Proposition 2.3. We include 
a proof in the multilinear language used here. 

Proposition 2.3. A form f £ M.[xi, . . . , Xn] of degree 2p is an sos if and only if there is a 
psd representation matrix Q such that f{x) — Q ■ x'^^'x'*''. 

Proof. When Q is psd and a representation matrix for /, then Q can be written as a ma- 
trix X]|/3|=p •^/3u'^ ® vl^ where the u'' are the unit eigenvectors with P ^ ^ j-g^j 

components for \a\ = p and A/j > are the corresponding eigenvalues. By the isomor- 
phism ( irni l it is a tensor g = E|/3|=p V(E|a|=p K^c,)®'^ that acts as g ■ ^®p^®p = 
E|/3|=p MT.\c.\=pui^°'f . Thus /is SOS. 
If / is SOS, then it is a sum of forms 



.\a\=p I \|a|=P / \|a|=P 



Oq e M. g can be taken to be a sum of tensors ( X)|a|=p '^"Eq j each with a psd 
representation matrix. □ 
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A psd representation matrix Q G 5*^(5^(11")) is also called a Gram matrix. For a 
form f of degree 2p to be an sos it is necessary and sufficient that it have a representation 
f[x) = Q ■ x'^Px'^P for some Gram matrix Q. 

An element of S'-^(5p(R")) may also be viewed as a linear transformation t i-^ 5t on 
S'P(R")sothat5-st = s-5t. 

Two more elementary but useful observations follow from the characterization of sums 
of squares given by Proposition 12.31 and elementary properties of psd matrices. 

Suppose Q is a Gram matrix. Then the form Q ■ sc^P-j^^P is positive definite if and only 
if the tensor (Q + A)x'*'' ^ ^ for all nonzero x S M" and for all changes A. 

For X, y e M" put z 

^ 5?/to(x^(P"2) ® y ® y) + . . . + i(p S'ym(x®(P-i) (g) y) ) 

Re z®P + Urn x®^ 

A linear transformation on 5p(R") is extended to complex valued tensors by 5(s + it) — 
Ss + iSt. It follows that A • z^^z^p = for all changes A. This is because the coefficients 
on the powers of the real variable t in A • (x + ty)®P{K + ty)®^ = must all vanish. The 
same coefficients occur on the unreduced powers of i in A • tP^tPp . Or one can invoke the 
multi-index formalism. Similarly, by comparing coefficients between binomial expansions, 
(12.51 1 extends to complex rank-one tensors 

E„ • z®P = z" 

(2.18) 

Let S be a representation matrix. Then S ■ x'^^x'^^ is a coercive sos if and only if 
there exists a A such that 5 + A /i a Gram matrix, and for every nonzero z G C" the 
tensor {S + A)z^p ^ 0. 

For when 5 + A is a Gram matrix it may be written ^ gj ® with the collection of 
gj e 5P(R") linearly independent; and [S + A)z®p = ^(g^ • z»P)gj. 

The strategy, then, for showing that a positive definite sos is a coercive sos is to change 
the Gram matrix, preserving its psd property, in order to eliminate from the null space all 
2-dimensional subspaces of the form span{s, t] where s + it = z®p for nonzero z G C. 
In this way the point of view of this article is opposite that of some literature growing 
out of Hilbert's theorems on sums of squares. For example, the coercive result (II. 10b is 
achieved by eliminating the nontrivial null space altogether, i.e. showing that pd Gram 
matrices exist for those cases. On the other hand, the most remarkable and difficult result 
of Hilbert's is that for the cone ^3,4, where the rank of a Gram matrix can be as large as 6, 
every polynomial can be written a sum of just 3 squares. Out of this came the general idea 
of the length or minimum number of squares required for an sos representation and out of 
this the Pythagoras number, the minimum number of squares needed over a collection of 
sos polynomials. See, for example, IIBCR98I . IICLR95II . llPfi95]| . UPDO I II and others. 



= X + iy e C". Then formally using the binomial expansion 

m=0 ^ ^ 
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For coerciveness the length of an sos is often an undesirable number, and one naturally 
wishes to maximize the number of independent squares in a representation. That this is 
an interesting problem is shown here by demonstrating, in the case of a positive definite 
polynomial with psd representation (Gram) matrix, that the rank of its Gram matrices 
cannot in general be increased enough to achieve the desired end, vis. coerciveness. 

We end this section by restating question (II. Il l in multilinear language and by outlining 
the construction by which the answer is shown to be no in general. 

Suppose Q e S'^{S^{MJ^)) is a Gram matrix and Qx^p ^ fi^r all rank-one tensors . 
Does there exist a change A such that C/ + A /i a Gram matrix and {Q + A)z'*^ ^ 0/or 
all nonzero z Cz C7 

Or less precisely, can a Gram matrix Q that is pd on the rank-one tensors be changed to 
be a Gram matrix that is pd on all subspaces of the form span{s, t} where s + it = z'^^ 
for some nonzero z e C? 

The question is answered below in the negative, for the cases n > 6, p = 2 and n > 4, 
p = 3, by the construction 

(2.19) Construct a Gram matrix Q such that 

(i) Q is positive definite on the rank-one tensors. 

(ii) there exists a nonzero z G C" such that the tensor Qz^p = 0. 
(iii) ^ + A is never a Gram matrix whenever Az®p 7^ 0. 
A uniqueness condition stronger than (iii) is 

(iii)' Q + Ais never a Gram matrix whenever A 7^ 0. 

3. A POSITIVE DEFINITE QUARTIC WITH A UNIQUE GRAM MATRIX 

In this section an element of is constructed that satifies (i) and (iii)' of the con- 
struction ( 12.191 ). but not (ii). 

The vector space of representation matrices S'^(S'p(M")) inherits a topology from the 
Euclidean space of the same dimension. The closed cone of Gram matrices will have as its 
interior the cone of positive definite Gram matrices. The boundary of this cone is the set 



Part (ii) of the construction ( 12.191 ) cannot be realized if Q is taken in the interior of the 
cone. Thus Q must be on the boundary if one hopes to realize (ii) and one is led to consider 
pd polynomials of degree 2p that border those that are not sums of squares. Historically 
pd and psd polynomials that are not sos are difficult to locate. It is therefore sensible to 
begin with a known pd polynomial that is not sos, i.e. does not have a Gram matrix but 
is definite on the rank-one tensors, and perturb it in such a way so that one arrives at the 
boundary of the Gram matrices while maintaining the rank-one definiteness. Here we take 
n = 4,p = 2, let X G correspond to {w, x, y, z) and begin with the Choi-Lam quartics 
qri ( II. ni l. ( 11.12b . letting 77 increase until the quartic ( 11.131 ) is achieved. 

Except for the uniqueness of representation claim, all other claims made for ( 11.131 ) in 
Section[T]can be quickly proved. 

1. By expanding the right side of ( 11.131 ) and collecting terms the right side meets the 
definition of ( 11.121 ) if the coefficients on the x^y^ ^ y'^z'^ and z'^x^ terms equal 1. This 
occurs when 



of Gram matrices with rank less than 
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2. is a root of ~ + ^ = 0. 

3. y/fj^ must be chosen to be the smallest positive root, else 770 would not be the smallest 77 
that makes g.,, an sos. 

Since degree and dimension are low in this section, tensors Eq will be denoted by using 
only the entries of each multi-index as subscripts, as in Eijki instead of E(j ^ Thus 

E2000 • = xj = w^, etc. 

4. That rjo, as described in Claims 2 and 3, is the smallest 77 for which is an sos will 
follow once it is proved that 



(3.1) Q,fo — (E2000 ~ \/^(Eo2oo + E0020 + E0002)) 
2 



«i2 



«i2] 



[(3V??oEiioo — Eooil)**^ + (Sy^Eioio — Eqioi)®^ + (Sy^Eiooi — Eolio) 

is the unique Gram matrix Q for which q,-^, (x) = Q ■ x'^^x'*^. For if g,, were an sos for 
some rj < rjo, then 

(3.2) q^, = + (7/0 - +y^ + z^) ^q^ + {rjo - rj){{x^ - y^f + {V2xyf + z^) 

and the polynomial identity presents two different Gram matrices for . Letting be, 
by Proposition l2.3l a Gram matrix for q,p q,,,, now has both 



(3.3) Q„ + {m ~ v){K2m + <L + E|,02) 
and 

(3.4) Q„ + (770 - 77)((Eo2oo - Eoo2o)^' + "^Efiio + E® 02) 

as Gram matrices. They differ by A ~ (r/o — r]){2'Ei®^-^Q — E0200 ®s E0020) contradicting 
the uniqueness of Qj^q . 

Remark 3.1. In contrast, the identity 2a;* + 2y'^ — {x^ — y^)"^ + (x^ + y^)'^ suggests 
2Ef2oo + 2E|j|o and (E0200 - E0020)®' + (E0200 + E0020)®' which ai-e identical Gram 
matrices. The two polynomial expressions are said to be obtained from one another by 
orthogonal transformation. See Proposition 2.10 of fCLR95|, p. 108. It is for this reason 
that by themselves it is not clear that each of (13.31) or (13.41) differs from since Qr) is 
unspecified. 

5. That (7,jj, is coercive is seen by showing that the corresponding homogeneous system of 
four quadratic equations has no solution in C' \ {0}. One starts with assuming a solution 
(w, y, z) has one of its coordinates equal to zero, cases that can be quickly eliminated. 
Then, assuming a solution has all nonzero coordinates, one has by using the last three 
quadratics of ( 11.131 ). y^z — 3y/r]owxy = zx"^ etc., whence x^ = y"^ = z^, whence 
3y^|w| = |a:| by any of the last three quadratics. Then \w\'^ = 3y^|a;p by the first, 
whence ^Jrjo = \ which is not true by Claim 2. 

The only task remaining is to prove the uniqueness of the Gram matrix Q^^, . Before that 
is done a bit more will be said about finding (|1.13t . 

An initial choice of representation matrices for the forms is 

0,-1 — ■ii'2000 ' -^0110 I" -^0011 -^0101 
2 2 2 2 

— 2 (Eiloo ®s Eooii + Eioio ®s Eqioi + Eiooi ®s Eono) +^(Eo2oo + E0020 + ^0002) 
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The qrj are symmetric in x, y and z. As 77 increases, if Q becomes the first Gram matrix 
encountered so would be Q' where Q' is derived from Q by permuting the indices for a;, y 
and z. Averaging all such permutations would produce a first Gram matrix that was sym- 
metric in X, y and z. Therefore the symmetry in the choice of 5^ is no loss of generality, 
and we expect that if a Gram matrix uniquely represents a q^, then it will be symmetric in 
x, y and z. 

Arrange the basis elements E200O) • • ■ according to the linear order uP' ^ ^ y"^ ^ 
z^ < wx -< yz < wy -< zx < wz -< xy. Then the matrix for 5,, with respect to the basis 
(l24]lis 



/ 1 

-b 
-b 
-6 



(3.6) 



-b 

V 
a 
a 



-b 
a 
V 



-b 
a 
a 
V 



\ 



2b 



1 - 2a 



2b 



1 - 2a 



2b 



V 



l-2a / 



when the parameters a = b = 0. The unmarked entries are zero. 

The two parameters permit the addition of six changes in a way that also obey the 
symmetry considerations in x, y and z. The smallest value of rj that allows a choice of a 
and b so that each of the four block matrices becomes rank-1 and psd is the 770 defined 
above. The minimizing choices are a = r/o and b — ^Jrjo- 

There are, however, twenty independent changes A in 5*^(5^(11^^)) altogether Though 
the type of argument being given can be made rigorous and lead to a uniqueness proof 
for , we will instead present another argument which will also be elementary, but also 
clearly decisive while computationally not too long if Maple^*'10 is used. It is based on 
the observation 

Suppose Q is a Gram matrix. Then a necessary (but not sufficient) condition for Q + Ato 
be a Gram matrix is that A be psd on Null{Q), the null space of Q : S'p(IR") S'^'(IR"), 
i.e. for every t G Null{Q) it is necessary that A • tt > 0. 

Let iV be a nonempty subspace of S'p(M"). When 5 • tt > fails to hold for some 
t G while 5 • ss > for an s e A^, 5 is said to be not definite on A^. Thus 

(3.7) 

If f{x) — Q ■ x'^Px^P where Q is a Gram matrix and if every nonzero A € A^'P'" is 
not definite on Null{Q), then Q is the unique Gram matrix for f. 

This is in fact a statement about subspaces of S'p(]R") and the Gram matrices that can 
be supported on their orthogonal complements. Consequently 

Let N be a subspace of S^iW^) and {ti, . . . , t^} a basis for its orthogonal comple- 
ment M. Suppose every nonzero A € A^^p^"^ is not definite on N. Let T be any linear 
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transformation on M. Then Qt — (T(ti))'^^ + • • • + (T(tr))®^ is the unique Gram matrix 
for the SOS fx (x) = Qt ■ x®^x®p. The collection of all such fx is a convex cone of S]n.2p- 
The last statement follows because if Qt and Qjj are psd on M so is their sum which 
will be given by some Qv with the linear transformation y on M derived, for example, by 
using ( 12.17b . 

Remark 3.2. If, for example, / is the identity on M and U is an orthogonal transformation 
on M, then fi = fu- This is Proposition 2.10 of IICLR95I again. 

Given a subspace N C S'^(M") of dimension m the following steps will be carried 
out in order to prove that certain sums of squares, supported like the above /t on the 
orthogonal complement of N, have unique Gram matrices. 

1. Form a general linear combination t = ati + bt2 + ■ ■ ■ of the m basis elements of N. 

2. Apply each element A of a basis for A^ ^ " ( 12.141 ) to the general linear combination, as 
A • tt, yielding a set of homogeneous quadratic polynomials in the m variables a,b, . . . 



3. Thinking of each quadratic polynomial from Step 2 as a linear expression in the mono- 
mials a^jb"^, . . . ,ab,ac, . . . , be, bd, . . ., write the 



P 



2p 
2p 



V 2 

by ^ + 1 ^ coefficient matrix for these linear expressions. 

4. Bring the coefficient matrix of Step 3 to reduced row echelon form thereby obtaining a 
set of quadratic polynomials that is equivalent to the set of Step 2, i.e. each set of quadratics 
consists of only linear combinations of quadratics from the other 

5. Show that no nontrivial linear combination of the quadratics from Step 4 yields a definite 
or semi-definite quadratic in the m variables. 

Remark 3.3. Steps 1 through 4 can be thought of as supplying details for an algorithm 
designed to show a certain semi-algebraic set consists (here) of one point (the origin). See 
the second algorithmic step and the remark that follows on p. 101 of MPW98I . Here it is 
Step 5 that is uncertain. 

In the case of interest here, there are m — 6 variables a, b, c, d, e, f and the coefficient 
matrix is 20 x 21, more quadratic monomials than quadratic polynomials. 

To simplify calculation, M"* (and thus ( 13.11 )) is scaled in the variable w, replaced with 
Define 



T7 3/2 



Then ( 13.11 ) is a linear combination with positive coefficients of the tensors 

(3.8) (3E2000 ~ 7(Eo2oo + E0020 + Eooo2))'^^ , 

(Eiioo ^ Eooii)**^, (Eioio — Eoioi)**^, and (Eiooi — Eono)**^ 

when 7 = 70. By Claims 2 and 3 at the beginning of this section the estimate ^^770 < 
1/3 holds, whence < 70 < 1. Thus all assertions about ( |1.13t will hold once the 
following theorem is proved. 
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Theorem 3.4. Given any ^, < 7 < 1, and any choice of aj > 0, j — 1,2,3,4, the 
quartic form of M[w, x, y, z] 

(3.9) ai{3w^ — 7(2:^ + + z^))"^ + a2{wx — yz)"^ + az{wy — zx)'^ + ai{wz — xyY 
is coercive and has a unique Gram matrix. 

Proof. Coerciveness follows as for (7,,^ in Claim 5 at the beginning of this section. 

Fix any < 7 < 1 and denote by Q^^ any linear combination, with positive coefficients, 
of the tensors ( I3.8l l. A basis for the null space of is supplied by 

Eiioo + Eooii, Eioio + Eoioi, Eiooi + Equo, 7E2000 + E0200 + E0020 + E0002, 

E0200 — Eoo20j and E0200 — E0002 



as ( 12.31 ). ( 12.71 ) and ( 12. 8b show. A general linear combination of these is g = 2aEiioo + 
2aEooii + 26E1010 + 26E0101 + 2cEiooi + 2cEoiio + 7dE2ooo + (rf + e + /)Eo2oo + 
{d ~ e)Eoo20 + (d - /)Eooo2 

A basis for the changes A-^-^ divides into three sets depending on the number of multi- 
indices a with a! = 2 that are used to express a A. The first type has two such a as 
in ^ 

~ 2^°^"" E0020 
there are 6 of these altogether The second type uses one as in 

2E2OOO ®s EollO — -^UQO ®s EioiO 

There are 12 of these. Finally there are only 2 independent changes that use no a\ = 2. 
We will use 

11 11 

-Eiioo ®s Eooii — -Eioio ®s Eoioi and -Enoo Eqoii — -Eipoi Equo 



The last type was used implicitly in the initial choice (13.51 ). The first type was introduced 
by the parameters in ( I3.6l i. 

Keeping in mind that by ( 12.7b and ( 12.81 ) Eq, • E^ = ^ and computing A • gg we obtain 

a2-7d(d + e + /) 
62 - ryd{d ~ e) 
c2-7d(d-/) 
a2-(d-e)(d-/) 
62-(d + e + /)(d-/) 
c2-(d + e + /)(d-e) 

then 



7da — he 
jdb — ac 
^dc — ab 
{d + e + f)a ~ be 
ld + e + f)b-ac 
[d + e + f )c - ab 
{d — e)a — be 
(d — e)b — ae 
(d — e)c — ab 
{d — f)a — be 
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{d~f)b-ac 
{d — f)c — ah 

and then 

Linearly ordering the monomial squares in alphabetical order followed by the indefinite 
monomials in alphabetical order a^,b^, . . . , f^,ab,ac, . . . , af, be, ... , df, ef the 20 x 21 
coefficient matrix of Step 3 above is obtained. Passing to reduced row echelon form, a 
matrix that consists of a 20 x 20 identity matrix together with a 21st column with successive 
entries 

— — , 2, 2,0,0,0,0,0,0,0,0,0,0,0,0,0,0 
1 — 7 1 — 7 1 — 7 1 — 7 

is obtained. 

Thus an equivalent set of quadratic polynomials is 



(3.10) a2 + -^e/ 

1 -7 



62 



+ 1 




1 

7 




7 




+ 1 




1 

7 




1 




+ T 




7 



e2 + 2e/ 
P + 2e/ 

together with the collection of 14 indefinite monomials a6, ac, . . . ,df (ef not included). 
Precisely when < 7 < 1 is there no nontrivial linear combination of these that yields a 
definite or semi-definite quadratic polynomial. Thus uniqueness follows from (13.7b . □ 

More generally, the quartics i3.% are pd whenever 7 7^ and 7 7^ 1. When 7 < 0, 
expanding the first square makes it transparent that the quartics ( 13. 9b have positive definite 
Gram matrices and are thus coercive sos. When 7 > 1 it is not clear in this way, but it is 
clear from (13.10b that there is a A that is positive definite on the null space of the (from 
the proof) that represents a ( 13. 9b . By taking e > small enough + eA will be pd by the 
proposition below. 

In some cases there only exist nontrivial A that are positive semi-definite on the null 
space of a psd Q. In those cases the proposition below gives necessary and sufficient con- 
ditions for Q + eA to be psd, i.e. for the associated sos to not have a unique Gram matrix. 
When Null{Q) n Null{A) ^ Null{Q) the propsition gives necessary and sufficient con- 
ditions for Q + eA to be psd with greater rank than Q. It provides conditions to build up 
the ranks of Gram matrices associated to an sos in an attempt to prove coerciveness of the 

SOS. 

The length of a vector x G M™ is denoted |x| and the operator norm of an m x m 
matrix B, as a transformation on K.'", is denoted \B\ — max|x|=i |-Bx|. 
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Proposition 3.5. Let A be real symmetric positive semi-definite m x m matrix. Let B 
be real symmetric m x m matrix that is psd on Null{A) C M™, i.e. z • _Bz > Q for all 

z e Null{A). 

Then for all e > small enough A + eB is a positive semi-definite matrix if and only if 
whenever zi G Null{A) and zi • Bzi — it follows that Bzi = 0. 

In the case A + eB is psd Null{A + tB) C Null{A) for all e > small enough, with 
strict containment when z • Bz does not vanish for every z G Null{A). 

If B is pd on Null{A) then A + eB is pdfor all e > small enough. 

Proof. A and B are assumed nontrivial. The last statement is proved first. 

Let a > be the smallest nonzero eigenvalue of A. Let 6 > be the smallest number 
satisfying z • Bz > 6|zp for all z G Null{A). Each x G has a unique decomposition 
X = y + z where z G Null{A) and y is orthogonal to Null{A), i.e. by the symmetry of 
A, each y is a sum of the eigenvectors of A that have positive eigenvalues. Thus 

(3.11) X • (A + eS)x = y ■ Ay + ey ■ By + 2ey ■ Bz + ez ■ Bz > 

a\y\^ - e\B\\y\^ - 2e\B\\y\\z\ + eb\z\' 

For X 7^ this last quantity will always be positive for any e satisfying < e < fsp^r&iBi ' 
proving the positive definiteness of A + eB. 

Now assume B is psd on Null{A). The first conclusion is proved next. 

Assume for some e > OthatA+eSispsd. Letzo G A^mZ/ (A) and assume Zq -Szo = 0. 
Thus zo-(A+ei?)zo = 0. Since A+eB has a psd square root it follows that (A+ei?)zo — 
whence Bzq = 0. 

For the other direction and for each x G M™, with x = y + z as before, the equality 
in ( 13. lit is again obtained. Each z G Null{A) has a unique decomposition z = zq + zi 
where zo G N ull {A) C^ N ull {B) wdzi G Null{A) is orthogonal to Null{A)nNull{B). 
In the event Null{A) n Null{B) = Null{A) it follows that z = zq and dTTTT ) yields 
X • (A + eB)yi > a|yp — e|i?||yp > for every x if e is small enough, with vanishing 
occurring only when x G Null{A). Otherwise there is a smallest number 6i > such that 
zi • Bzi > hi |zi |2 for all Zi G Null{A) orthogonal to Null{A) n Null{B). This follows 
by the hypothesis, zi • Bzi = implies Bzi = 0, whence zi G Null{A) n Nidl{B) 
whence zi ~ 0. Consequently z may be replaced by Zi and h by bi in (13.111 1. For all 
X ^ Null{A) n Null{B) and e > small enough ( 13.1 lb is then positive, completing the 
proof of the first conclusion. 

It has been shown for e > small enough that positivity of ( 13.111 ) fails only when 
X G Null{A) n Null{B), proving the second conclusion. 

□ 

/100\ /060\ 
Example 3.6. A = i is psd and S = 6 is psd on Null{A), 

\000/ \001/ 
but whenever b ^ Q and e ^ Q A + eB is not psd. 

This phenomenon persists when the B are specialized to represent changes A. Consider 
the coercive sos in noncoercive representation (x^ + y^)^ + + y^z^ + x^z^, i.e. with 
Gram matrix A = (E200 + + Eg,^ + Ef^^ + Ef^^ . Then A = E002 <E>s Eno - 

Eon (X)s Eioi is trivially psd on Null{A), but A + e/S.is not psd unless e = 0. Here 
A • EiioEiio = while AEno = 5Eoo2- 
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4. Proof of Theorem 1 1.41 
Theorem ll.4l follows from the next theorem. 
Theorem 4.1. Given 7, < 7 < 1 /3, the positive definite quarticform o/M[u, w, w, a;, z] 

/ = [v? -^v^ + vw)'^ + [w^ — 7(x^ + + z^))^ + [wx — yz)"^ + [wy — zx)^ + [wz — xyY 
is a noncoercive sum of squares. 

Proof. The last four terms sum to a pd form over R'' as shown in the last section. From 
this, positive definiteness over follows. On the other hand (1, i, 0, 0, 0, 0) G is a root 
for each of the five squared quadratics, i.e. the real and imaginary parts of 

(4.2) (e^ + ie^)^^ ^ E200000 - E020000 + 2iEiioooo := r + iq 

are in the null space of the Gram matrix Qq that gives representation (14.11 ) for /. Using 
( 12.18b . noncoerciveness of / will be proved by showing that every Gram matrix for / 
contains r and q ( 14.2b in its null space. 

Denote Ai = -5E200000 <^s E020000 + Efj^oopg. Then 

(4.3) Ai • rr = Ai • qq = 1 
There is a basis 

(4.4) {Ai,A2,...,Aio5} 
for ^2,2,6 ^jjjj ( 14.31 ) as its first member so that 

(4.5) Aj • rr = Aj • qq = 

for all j — 2,3,..., 105. This follows because the basis elements of ( 12.151 ) E^ E^ — 
Eq/ a + P — a' + /3', permit one of the equalities in ( 14.5b not to hold only when 

either both and E^ are contained in {E200000, E020000, Enoooo} or both Eq- and E/j- 
are contained. The only basis element Uke this is ±Ai. 

Remark 4.2. This relationship between a z'*^, z e C", and some basis for A^'^'" is gen- 
eral. The uniqueness does not quite hold in A^'^'"^, p > 3, however For example, both 
Efj^ — 5E21 Eo3 and Efj^ — ^Ei2 E30 are nonzero as quadratic forms on the real 
and imaginary parts of (e^ + ie^)^^. 

If Ai is removed from the basis (14.4b and A is taken in the subsequent span so that 
Qo + A is a Gram matrix, Proposition l3.5l and ( 14.51 ) then imply that r and q will also be in 
the null space of + A . Together with ( 14. 3b this implies 

(4.6) 

Any linear combination A of basis elements ( 14.4b , for which Qo + A is a Gram matrix 
and for which at least one of r or q is not in the null space of Qq + A, must have a 
positive coefficient on Ai. 

Hence let (5 > and consider the following principal submatrix of Qq + 2(5Ai where the 
order Equooo < E200000 -< E020000 < E002000 -< E000200 < Enoooo -< Eioiooo -< 
Eoioioo -< Eiooioo (i-e. vw < ~< v'^ < w'^ ~< < uv ^ uw ~< vx ~< ux) has been 
chosen, and a~h — c — d — e = Q. Blank entries are zero. 
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(4.7) 



/ l + 2a 1- 



1-S 



1-S -d 



-1 7 



25 

c 



c 

2d 



-2b 



V 



-2e / 



The following notation for principal submatrices of ( 14.7b will be used. [1 3] denotes the 
I + 2a 1 , from the 1st and 3rd rows and columns of i4.7\ . etc. 



submatrix , ^ ^ 

The parameters a,b,c,d,e correspond to the changes 2E^]^]^qqq 



E020000 ®s El 



002000, 



Eioiooo 

n«>2 



— E200000 '9s EolloOO, 
100100 respectively. 



2E' 



«i2 

101000 



E020000 ®s E000200 — 2EQ^Qj^Qg, Eiioooo '9s 
E200000 '9s E002000, E200000 '9s E000200 ~ 2E!j^ 

No other nonzero entries may be altered: The four I's in [1 2 3] because there is no 
basis element of A^'^'^ that is expressed using these positions. The three entries with 5 
because the only change possible has already been chosen. [4 5] because the quartic form 
/(O, 0, w, z) has a unique Gram matrix by Theorem 13. 41 and [4 5] is a submatrix of 
that Gram matrix; if a C^o + ^ is a Gram matrix, then by deleting all rows and columns that 
involve the variables u and v one obtains a Gram matrix for /(O, 0, w, x, y, z). 

When a = c = it follows that dct[l 2 3] = — (5^ < 0. Since all principal minors 
of a psd matrix must be nonnegative, a = c = cannot hold. It will first be shown that 
a = is necessary and then that c = (5 is necessary, leading to a contradiction that proves 
the theorem . 

The determinant of [1 3] forces a > 0. Introducing h, det[3 4 5] = — (6 — 07)^ whence 
& = 07 > 0. But submatrix [8] implies 6 < whence a = also. 

With a = it follows that dct[l 2 3] -((5 - c)^ whence c = 5. Consequently [6 7] 
requires d > 0. Now det[2 4 5] = — (e — 1^7)^ whence e > contradicting submatrix [9]. 

□ 



5. A 6th order example 

Consider the family of sextics 
(5.1) 

c t \ 2/22, 2 ^ 2\2 I 2^ 2 2 I 2 ^ 2\2 ,2/22, 2 ^ 2\2 

fp[x,y,z) [p X +py - -^z ) +y [p y + pz - i^x ) +z [p z + px - -y ) 

The three cubic polynomials that are squared have a common nontrivial root only when 
p = 0, = — i, = — 5+3V3 Qj. p3 _ -5+3V3 ^ g^^jj ^^^g jj^g j.^^^ taken in 

R^. Thus fp is pd if and only if p^ does not take the four listed values. In addition, every 
pd form fp is coercive. 

Put ?7o = (1 + V^)^^- Then for the Choi-Lam sextics ( |1.12| l. s^^ = (1 + \/h)fp when 
p = (1 + \/5)^^. It will be shown that (1 + \/5)^ belongs to an interval of p's for which 
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the fp have unique Gram matrices. This uniqueness impHes, as in the quartic case, that rjo 
is the smallest value of 77 for which is an sos. The identity used in (I3.2l l may be replaced 
with x'^ = {x^ - 2xy2)2 + (y3 _ 2x'^yY. 
Hence, an apparent Gram matrix Qp for each fp is 

(P^E300+pEi20— -Eio2)'*^ + (p^Eo30+pEoi2— -E2io)'*^ + (p^Eoo3+pE2oi — -£021)*^^ 

acting on the space 5'^(M"^) which has 10 dimensions. Therefore using Eq, • = ^ 
the null space for Qp is spanned by the vectors E300 — 3pEi2o, E120 + 2pEio2, E030 — 
3pEoi2, E012 + 2pE2io, Eooa - 3pE2oi, E201 + 2pEo2i and Em. A general linear com- 
bination is 

(5.2) g = aEaoo + 3(6 - pa)Ei2o + 6p&Eio2 + cEo3o + 3(d - pc)Eoi2 + QpdE2w 

+ eEoo3 + 3(/ - pe)E2oi + 6p/Eo2i + GgEin 

The 27 dimensions of the subspace A^'^'^ of changes may be briefly described as fol- 
lows. 



-E300 E120 — -E210 ®s E2 



is representative of 6 changes. 



is representative of 3. 



representative of 3. 



representative of 6. 



representative of 6. 



-E300 <Xis Em — -E210 ®s E201 



-E300 ®s Eo30 — 2-^210 ®s E120 



-E300 ®s E02I — 2E2OI ®s E120 



-E300 <Xis E021 — -E210 ®s Em 



1 1 

-E210 <Xis E012 — ®^ ^111 



representative of 3. Keeping in mind the examples E300 • E300 = l,Ei2o • E120 = 
1/3 and Em • Em = 1/6, and computing A • gg for each change yields the quadratic 
polynomials 

-pa? + ah~ Ap^d? 
-pc^ + cd- Ap^p 
-pe2 + e/ - 
2pab - p2e2 - p + 2pef 
2pcd - p^a^ -h"^ + 2 pah 
2pef - p^c^ -d^ + 2pcd 

ag — 2pdf + 2p'^de 
eg - 2pbf + 2p^af 
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eg - 2pbd + 2p'^bc 

ac + 2p^ad — 2pbd 
ce + 2p2c/ - 2pdf 
ae + 2p^be — 2pbf 

Zpaf — p^ae — bf + pbe 
3pbc — p^ac ^ bd + pad 
Sped — p^ce — df + pcf 
^pac + ad — Ap^bd 
—pee + cf — Ap^df 
—pae + be — Ap^bf 

af - dg 
be- fg 
de — bg 

—pac + ad + peg — fg 
-pee + cf + pag - bg 
—pae + be + peg — dg 

-2p'^cd + 2pd^ - g2 
-2p\f + 2pp - g" 
-2p^ab + 2pb'^ - g"^ 

Linearly order the 28 quadratic monomials a^ , b^ , . . . , g^ , ab, ac, . . . , ag, be, 
. . . , eg, fg as before and put the resulting 27 x 28 coefficient matrix into reduced echelon 
form. When the 26th column (the ef column) is removed the result is the identity matrix. 
Putting a = jijz^l^,T — jSi^ ^"d (f> = ^^^^f^^j-^j the 26th column has successive 
entries 

-a, -T, -a, -T, -a, -t, -cj), -1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, -1, 0, 0, 0, 0, 0, 0, 0, 
Thus an equivalent set of polynomials is 

(5.3) a^ - ere/ 

62 _ re/ 
- ere/ 
d2 - re/ 
e^ - ere/ 
P-ref 

9^ - (f>ef 
ab — ef 
cd — ef 

together with the remaining 18 indefinite monomials none of which appear in the poly- 
nomials ( 15.31 ). By ( 13.71 ) a sufficient requirement for fp to have a unique Gram matrix 
is that there exists no nontrivial linear combination of the polynomials ( I5.3l l that is a 
definite or semi-definite quadratic polynomial in the variables a, . . . ,g. This require- 
ment is equivalent to showing for a given p that every nontrivial choice of parameters 
A, B, C, D, E, F, G, J, K in 
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(5.4) 



2A 



J-Acr-Br 



\ 



J-Au-Bt 



2B 



2C 



K-Ccr~DT 



K-Cct-Dt 



2D 



2E 



-Ea-FT-G4>-J-K 



-Ecy-FT-G<j>-J-K 



2F 



\ 



2G ) 



produces an indefinite matrix. 

When cr, r and are not all of the same sign there exist, by ( 15.31 ). choices of positive 
A, . . . , G that make (15.4b definite. Lack of a common sign holds for — l/2<p'^<0 and 
1/16 < . When (? < —1/2 each of cr, r and (p is negative while each is positive for 
< p3 < 1/16. 

Restricting to those nontrivial choices with G = J = K = 0, all produce indefinite 
matrices ( 15.4b if and only if ctt > 1. For example, the 2x2 minor AEF — [Ea + 
Frf <Q if and only if err > 1 if and only if - < < -^+^^ , Therefore the 

remaining intervals for p^ for which all nontrivial ( 15.4b are possibly not definite are the 
open intervals (— 5+3V3 ^ ^j^^j ^-q^ -5+3\/3 That (j) shares the same sign with a and 
T in these intervals shows that choosing G > does not restrict these intervals further 
Neither can nonzero choices of J and K. The endpoints of the intervals yield fp that are 
not pd. 

The foregoing proves 

Theorem 5.1. The forms fp are pd and have unique Gram matrices if and only if — < 
p^ < or Q < p^ < ~5+3\/3 ^ other pd fp have Gram matrices of rank 10. Each fp, 
for p^ not equal to the endpoints of the above intervals, is coercive. 

(V5 + 1)"'^ is contained in the second interval. 

To prove Theorem ll.6l we will be content with a single example. Take p — —1. 
Theorem 5.2. The positive definite sextic form ofM.[w, x, y, z] 

(5.5) g{w, X, y, z) f-i{\/ + x^, y, z) = {w'^ + wx^ — wy^ — ^wz^Y 

I / 2 I 3 2 1 2\2 I / 3 2 1 2 ^ 2\2 , / 3 2 2 ^ 2\2 

+ [xw +x —xy ^2^^ ) +(y yz — ~2^^ +(z —zw — zx ^ i^^V ) 
is a noncoercive sum of squares. 

Proof. Let denote the apparent Gram matrix for g and let i denote the unique Gram 
matrix for 

For z e denote + — Then the precise relationship between com- 
mon complex roots for sos representations of g and /_i is Qqz®^ = if and only if 
, Z3, Z4)'^^ = 0. Consequently by Theorem l5.1l and ( 12.18b ^ = Z3 = Z4 = when 
^03 is in the null space of Qq. Thus z := (1, i, 0, 0) may be taken, up to scaling, as the 
only nontrivial common root in the sos representation ( 15.51 ) for g. 

For g to be coercive there must exist a A such that C7o + A is a Gram matrix and 
Az'*^ ^ ( 12.18b . Therefore, similarly to the quartic case, at least one of Ai = Ef2oo - 
iE2ioo <^s E0300 or A2 = Efj^oo - iEi2oo E3000 (see Remai-k|4!2l) must be included 
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in A with a positive coefficient. However, if A' is obtained from A by permuting the 1st 
and 2nd components of each multi-index of the basis elements (12.41 ). then Qo + A' would 
also be a Gram matrix because of the symmetry in w and x of ( I5.5l l. Further, because of 
positive semi-definiteness, z'^'^ is not in the null space of tJo + ^A + ^A' when it is not 
in the null space of Qq + A. Consequently, for g to be coercive, values for the parameters 
a, b, c, d with a + b + c + d = Oin 
(5.6) 

/ 1 + 2S 1-5 -1 + a \ 



1-6 1 -1 
1 + a -1 1 



1 + 2S 1-S -1 + b 
1-6 1 -1 
-1 + b -1 1 



1 -i -i 

^2 2 



111 

2 4 4 



2 4 ^ ^ 4 



must be found that make ( 15.6b psd when 6 > 0. Here ( 15.61 1 is the principal submatrix of 
Go + 2(5Ai + 26A2 corresponding to E1200 -< E3000 E1020 E2100 E0300 ^ 
E0120 -< E0030 -< E2010 -< E0210 -< Eiiio (i.e. wx^ ^ ^ wy^ -< w^x ^ x^ ^ 
j,y2 ^ y3 ^ yj2y ^ ^2y ^ ^ ^ ^Y^Q parameters represent the three changes E2100 ®s 

Eoi20~Eiiio(8)sEiiio, E2oio'8'sEo2io^Eiiio®sEiiio, and Ei2oo'8)sEio2o — Emo^s 

Eiiio- 

With the same notation as in the quartic case, the submatrix [2 3] of ( I5.6l l is fixed because 
it is a submatrix of the unique Gram matrix jF_i for g{w, 0, y, z) — f-i{w, y, z). So is 
[5 6] because (7(0, x, y, z) — f-i{x, y, z). In the same way [7 8] and [7 9] are fixed. With 
no other choices and det[l 2 3] = —(a — 5)^ it follows that a = 5 is forced. In the same 
way b = 5 and c = 0. Thus d — —25, a contradiction, and g cannot be coercive. 

□ 



Remark 5.3. By Nullstellensatze (see pp. 56-57 of l|Pfi95ll ) every collection of homoge- 
neous polynomials pi, . . . ,pr G C[a;i, . . . , x„] with 1 < r < n has a common nontrivial 
zero a G C" to the system of equations pi = ■ ■ ■ = Pr = while the corresponding 
statement, for the polynomial ring . . . , and R" in place of C", holds only when 
all of the degrees di, . . . ,dr of the polynomials pi, . . . ,pr are not even. Thus the sextic 
example here is required to be the sum of at least 4 squares in order to be pd while the 
quartic examples are pd with but 5 squares of quadratics in the 6 inde terminates. The 5 
quadratics necessarily share a nontrivial complex root while the 4 cubics need not, though 
they do. 



6. The game 

Starting with the collection of pd sos in P3 4 one can obtain the coercive result ( ll.lOl i 
without using Hilbert's theorem on ternary quartics by considering several generic cases. 
One shows that the ranks of the Gram matrices arising in each case can be built up by 
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adding changes A as delineated in Proposition |33] When attempting to show that the pd 
elements of E4.4 are coercive the number of cases is significantly higher. 

The vector space 5^(M" ) is isomorphic to the space of real symmetric n x n matrices 
by assigning t e ^^(K") to the matrix with the coordinates t • e*e^ as entries 1 < i, j < n. 
Every change A G A^'^'" yields a quadratic form A • tt that is a linear combination of 
the 2x2 minors of the symmetric matrix t. The argument of Section 3 that can show 
that a pd quartic with Gram matrix Q has Q as its unique Gram matrix amounts to showing 
that a general matrix in Null{Q) C 5^(M") has the property that every nontrivial linear 
combination of its 2 x 2 minors is indefinite. For example, the pd quartic 

f = {x\+ x\ - x\~ x'l)'^ + (2X2X3 - Xg + x\f + {xiX'i ~ X2Xi)'^ + {xi - 2:3)^X4 

has the basis 2Eiioo, E2000 — E0200, E2000 + E0200 + E0020 + E0002, E0020 — E0002 + 
2E0110; 2E1010 + 2E0101, 2E1001 + 2E0011 for the null space of its apparent Gram matrix. 
The first two basis elements are the imaginary and real parts of z(g)z for z — (1, i, 0, 0) the 
common complex root for the sos f. A general linear combination of the basis elements 
corresponds to the 4x4 matrix 

/ b+c a e / \ 



(6.1) 



t = 



-b + c d e 
d c + d f 



V / e / c-d/ 

Here, however, there is a nontrivial linear combination of the 2x2 minors that is not 
indefinite. Otherwise / would provide a noncoercive example for n = 4. To prove that / 
is coercive it is necessary to produce a linear combination of minors of the form 

(6.2) + b^ - + A-tt 

that is psd and where the last term does not include the principle 2x2 minor det[l 2]. 
This is the same observation as ( 14.6b . It might not be clear that the last term can be made 
up of minors that yield a positive coefficient on the monomial without introducing more 
indefiniteness. However, it can be done. To express itself as a linear combination of the 
remaining 19 independent 2x2 minors it is necessary to use 18 of them. In fact, ( 16.2b can 
be made pd and thus / possesses a Gram matrix of full rank by Proposition l3.5l 

By a linear change of variables in any nontrivial common complex root for a pd 
quartic sos may be taken to be (1, i, 0, 0, . . .). Therefore the precise setup of the principal 
submatrix [1 2] of ( 16.1b . together with the presence in some way of the variable c outside 
[1 2], is a typical setup for the null spaces of Gram matrices when trying to answer question 
dl.lb in the quartic cases. When c does not occur outside [1 2] real values may be assigned 
to the variables making [1 2] and t rank-1 matrices, contradicting the positive definiteness 
of the form /. When only a and b occur in [1 2] / can be written as a sos that includes the 
term {x1 + x'^)'^ = xf + 2x\x\ + x\ in the sum. 

These observations lead to the following diversion. 

1. Set up the principal submatrix [1 2] of an n x n symmetric matrix t exactly as in ( 16.11 ). 

2. Write linear combinations of c and a number of other real variables for the remaining 
entries. Variable c must be used while a and 6 may not. 
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3. The choices made in Step 2 are not allowed to result in a rank-1 matrix for any choice 
of real variable values. This can usually be checked by inspecting for zeros an sos quartic 
form, i.e. Gram matrix, which will have the n x n matrix as its null space. 

4. Search for a linear combination of 2 x 2 minors (not including det[l 2]) which when 
added to + — results in a psd quadratic form. 

When n = 4 or 5 there are two or three ways to win this game. Find a setup for which 
the goal of Step 4 cannot be achieved. Or, when Step 4 does result in a psd quadratic 
but never a pd quadratic, show that the resulting change A always satisfies At ^ for 
some choice of real variable values. See Proposition 13.51 Or, prove that neither of these 
outcomes is ever possible for any t constructed according to Steps 1, 2 and 3, thus proving 
that every pd sos is coercive. 

7. Final remark on coercive integro-differential forms 

The results of this article when combined with the Aronszajn-Smith Theorem show that 
there exist homogeneous constant coefficient elUptic operators L with formally positive 
integro-differential forms ( 11.21) for which a coercive estimate like (ll.3l l is never true. How- 
ever, such an L could have an integro-differential form like (11.4b which is not formally 
positive but which satisfies the coercive estimate ( 11.3b when ( 11.4b is used on the left side 
in place of ( 11.2b . The author claims this to be always true in the quartic, i.e. 4th order 
operator, cases. The proof necessarily uses Agmon's characterization of coerciveness and 
will appear elsewhere. Thus Agmon's characterization is needed in order to answer the co- 
erciveness problem for differential operators even when those operators possess formally 
positive integro-differential forms. 
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